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l-Tevl, a double-strand DNA endonuclease encoded by 
the mobile td intron of phage T4, has specificity for the 
intronless td allele. Genetic and physical studies indicate 
that the enzyme makes extensive contacts with its DNA 
substrate over at least three helical turns and around the 
circumference of the helix. Remarkably, no single 
nucleotide within a 48 bp region encompassing this 
interaction domain is essential for cleavage. Although two 
sub domains (DI and DII) contain preferred sequences, 
a third domain (DUO, a primary region of contact with 
the enzyme, displays much lower sequence preference. 
While DO and Dili suffice for recognition and binding 
of I-7fevI, all three domains are important for formation 
of a cleavage-competent complex. Mutational, 
footprinting and interference studies indicate 
predominant interactions of l-Tevl across the minor 
groove and phosphate backbone of the DNA. Contacts 
appear not to be at the single nucleotide level; rather, 
redundant interactions and/or structural recognition are 
implied. These unusual properties provide a basis for 
understanding how l-Tevl recognizes T-even phage DNA, 
which is heavily modified in the major groove. These 
recognition characteristics may increase the range of 
natural substrates available to the endonuclease, thereby 
extending the invasive potential of the mobile intron. 
Key words: DNA— protein interaction/intron homing 
site/mobile intron/modified DNA/phage T4 



Introduction 

Intron-encoded double-strand endonucleases involved in 
intron mobility have been described in prokaryotes and lower 
eukaryotes (reviewed by Dujon, 1989; Lambowitz, 1989; 
Perlman and Butow, 1989; Belfort, 1991; Clyman et al, 
1993; Lambowitz and Belfort, 1993). These endonucleases 
share the ability to recognize and cleave their respective 
intronless alleles, thereby initiating recombination events that 
result in intron inheritance. This process is referred to as 
intron homing, with the horning site comprising the intron 
insertion site and the endonuclease binding and cleavage sites 
(Dujon et al., 1989). A distinctive feature of the prokaryotic 
endonucleases is that they cleave at a distance from the intron 



insertion site (Bell-Pedersen et al, 1990; Chu et al., 1990; 
Eddy and Gold, 1991). l-Tevl, the 29 kDa endonuclease 
encoded by the bacteriophage T4 td intron, binds an 
intronless td allele and cleaves 23-26 bp upstream of the 
intron insertion site (Bell-Pedersen et al, 1990, 1991; Chu 
etal., 1990). A further point of interest of the phage 
endonucleases is their ability to recognize and cleave both 
unmodified DNA and T-even phage DNA containing 
glucosylated hydroxymethylcytosine residues projecting into 
the major groove (Quirk et al., 1989; Bell-Pedersen, 1991). 

DNase I footprint analysis of the td homing site has shown 
that I-7e-vI makes high-affinity contacts with sequences 
flanking the intron insertion site, whereas protection extends 
to the cleavage site at high I-7evI concentrations (Bell- 
Pedersen etal., 1991). Although disruption of the intron 
insertion site by insertion or deletion abolishes I-7e-vI 
recognition, many missense mutations are tolerated 
throughout the homing site (Quirk etal., 1989; Bell- 
Pedersen etal., 1991; Chu etal., 1991; M.Bryk, 
unpublished). In contrast, l-Scel from the mitochondrial LSU 
rRNA intron of Saccharomyces cerevisiae tolerates only 
minimal degeneracy over an 18 bp region (Colleaux et al., 
1988). 1-SceH, from the aI4a intron of the COXI gene of 
S. cerevisiae, is less fastidious in recognition of its homing 
site than l-Scel, but also has distinct sequence preferences 
at a number of sites (Sargueil et al., 1990; Wernette et al., 
1992). 

To examine further the interaction of I-7evI with the td 
homing site, we are using a combination of genetic and 
physical approaches. With degenerate substrates and an 
enrichment scheme for isolating enzyme -substrate 
complexes, we have selected variants proficient in both 
binding and cleavage by l-Tevl. No single nucleotide within 
a 48 bp stretch containing the homing site is absolutely 
required for I-7J? vl activity. This sequence tolerance and the 
ability of l-Tevl to act on modified phage DNA support the 
findings of protection and modification interference studies 
indicating that l-Tevl binds predominantly across the minor 
groove in close proximity to the phosphate backbone. l-Tevl 
thus joins a small, growing list of proteins that interact largely 
across the minor groove of DNA and that may recognize 
structural properties of DNA rather than strictly sequence 
determinants. The evolutionary implications of such 
interactions are considered in terms of the ability of an 
endonuclease to impart and extend the invasive potential of 
a mobile intron. 

Results 

The sequence-tolerant td homing site contains 
hypomutable regions 

Previous studies indicated that a high degree of 
polyinorphism is tolerated in the td homing site (Quirk et al. , 
1989; BeU-Pedersen et al., 1991; Chu et al., 1991; M.Bryk, 
unpublished). To determine the sequence specificity of the 
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l-Tev\ -target interaction directly, an enrichment scheme was 
developed to isolate variant td homing sites that maintain 
the ability to bind \-Tev\ despite extensive sequence 
alterations (Figure IB). Mutant homing sites were prepared 
by synthesizing oligonucleotides that were contaminated I to 
15% with the other three nucleotides (Figure 1 A). 
Degeneracy of the mutant oligonucleotides was confirmed 
to be 15% by cloning and sequencing (Table IA, line 1, 
14.9%). The degenerate duplex pool was incubated witii 1- 
Tevl and separated by electrophoresis on native gels into 
three major species (Figure 1C): unbound DNA, a lower 
complex and an upper complex. For binding enrichment, 
the catalytically active upper complex was purified away 
from the inactive lower complex, which is an intermediate 
or by-product of the binding reaction (J.E.Mueller and 
M.Belfort, in preparation). Purified DNA from the upper 
complex was amplified by PCR and then used as the target 
for the subsequent round of binding. A doublmg in the 
fraction of bound DNA, from 16% to 30%, was observed 



after five enrichment cycles (Figure ID). Although the 
fraction bound did not reach wild-type levels (30% versus 
58%), the relative amount of the active upper complex 
increased over five cycles of binding enrichment from 43% 
to 100%, exceeding the fraction of upper complex for tne 
wild-type homing site (64%). 

DNA from the bound fraction from cycles 4 and 5 was 
cloned and mutants were sequenced and classified awording 
to their ability to be cleaved by I-r«4 (Figure £ TaWe 
Comparison of the compiled sequences (Figure 2B) of cloned 
variants with that of the wild-type td homing site revealed 
that 1-revI is, in general, extremely tolerant of base changes. 
In 62 cleavage-proficient variants (including two that were 
unselected), no single base within the 48 bp degenerate 
reeion was absolutely required for recognition by l-Tevl 
(Figure 2C). From Table 1, which contrasts mutation 
frequency in noncleavable (-) and cleavable (+/- to 
variants, it is apparent that the average mutation 
frequency among 60 enriched cleavage-proficient homing 
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Table I. Analysis of id homing site variants 
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•Cleavage proficiency was assayed as described in Materials and methods. N.t., not tested; 0% cleaved in 5 min; '+/-*, 5-50% cleaved in 5 
rain; ' + *, 40-50% cleaved in 1 min; * + + *, >50% cleaved in 1 min; ' + + + *, >90% cleaved in 1 min. (In three cases, where assignment to + 
or ++ classes was ambiguous, cleavage of &90% at 5 min was used to assign variants to the ++ class.) 
b Number of variants sequenced. 
c Average number of mutations per 48 bp. 

•"Mutation frequencies presented for: 'Entire' = 48-base degenerate region (bp -30 to +18); 'DI' = hypomutable domain in exon I (bp -21 to 
-14); 'DH' = hypomutable domain in exon D (bp +9 to + 15); 'DID' = (bp -8 to +8), defined on the basis of physical studies (BeU-Pedersen 
et al. f 1991); 'Other' = remaining 17 bases in degenerate region, collectively. The number of base pairs in each region is indicated in parentheses. 
«This set was analyzed statistically (Materials and methods). The breakdown of the cleavage-proficient variants from line 3 of part A is presented in 
part B. 

fWild-type id homing site is a member of the (+) class. 

sites was reduced from 14.9% to 9.8% (Table IA, lines 1 
and 3). In contrast, for the cleavage-minus variants, the 
mutation frequency over the 48 bp region was 20.5% 
(Table IA, line 2). These results suggest that, although 
binding is tolerant of sequence alterations, constraints exist 
for the formation of cleavage-competent complexes. 

To determine whether there are specific regions of the 
cleavage-proficient variants that are more sensitive to 
mutation than others, a two-stage statistical analysis was 
performed (Materials and methods). In the first, exploratory 
phase with 24 variants, two regions appeared conserved. 
These domains are DI in exon I (8 bp, -21 to -14) and 
DH in exon II (7 bp, +9 to + 15) (Figure 2B and C; mutation 
frequencies of 3.6% and 2.4%, respectively). In the second, 
confirmatory phase with 36 variants, the hypomutability of 
DI and DII was confirmed: the observed mutation 
frequencies, 4.9% and 4.0% respectively, are significantly 
lower than the input level of 15% (P < 0.05). The mutation 
frequencies of DI and DII for all 60 cleavage-competent 
variants are 4.4% and 3.3%, respectively (Table IA, line 
3). Of the 20 binding-proficient mutants that are not cleaved 
(Table IA, line 2), DH was conserved (mutation frequency 
of 9.3%; P < 0.05 for the significance of the difference 
from 15%), suggesting a requirement for DII in l-Tevl 
binding, while DI was not (mutation frequency of 16.9%), 
indicating a possible role for DI in the formation of the 
catalytically active complex. 

Surprisingly, the region flanking the intron insertion site, 
previously shown to constitute a portion of the high-affinity 
I-7evI binding site (Bell-Pedersen et a/., 1991), which we 
term Dm (16 bp, -8 to +8) (Figure 2C), was significantly 
more tolerant of mutation than DI and DO (P < 0.05) 
although, at a mutation frequency of 10.8%, it was conserved 
relative to the input degeneracy of 15% (P < 0.05; 
Table IA, line 3). The importance of this domain in 
generation of the cleavage-competent complex is further 
underscored by comparing the mutation frequency of DID 
for the cleavage-proficient variants (Table IA, line 3; 10.8%) 
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with the mutants unable to be cleaved (Table IA, line 2; 
25.9%). In contrast with DI, DH and Dm, the other 17 
positions in the 48 bp homing site ('Other*, Table I), which 
collectively had a mutation frequency of 13.9%, were 
unconsented (P > 0.05). 

By grouping variants into cleavage classes, it became clear 
that the mutation frequency over 48 bases decreased as 
cleavage efficiency increased (Figure 2B, Table IB). 
Although sample size within each class precludes a rigorous 
statistical analysis, it is interesting that the reduced tolerance 
for mutation is reflected in a steady decrease in mutation 
frequency for both DI and DII as cleavage ability increased 
(Table IB). In contrast, no steady decline was observed for 
DDI. Instead, the mutation frequency first dropped and then 
increased with increasing cleavage efficiency (Table IB, lines 
1-4). That is, for those variants that are cleaved more 
efficiently by l-7>vl than the wild-type id homing site (+ + 
and + + +), sequence constraints on DI and DH are 
heightened, whereas this is not the case for DEI. It remains 
to be determined whether the increased mutation frequency 
in DIH in the + + + class reflects a statistical fluctuation, 
or whether these mutations in Dm (Figure 2B and C) are 
responsible for the enhancement of function. 



Physical evidence for l-Tevl interactions at discrete 
domains 

To evaluate physical contacts made by l-Tevl at the homing 
site, hydroxyl radical protection experiments were performed 
with the active upper complex and inactive lower complex 
using Fe(H)EDTA reagent (Figure 3). The strongest region 
of hydroxyl radical protection flanks the intron insertion site. 
In a typical Fe(H)EDTA cleavage experiment, a second 
region of protection is evident in the upper complex, 
downstream from the I-7evI cleavage site (Figure 3A and 
B). Interestingly, I-7>vI does not protect the nucleotides that 
immediately flank the cleavage site on the top strand. 
Protection of the bottom strand in this region is unclear, 
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because it is obscured by cleavage of this strand by \-Tevl 
in the presence of Fe(E)EDTA reagents (Figure 3). 

In contrast to the upper complex, I-TevI protects only the 
region near the insertion site in the lower complex 
(Figure 3B), with no consistent protection apparent m the 
region extending towards the cleavage site. These results 
corroborate the genetic experiments in which Dl, near the 
cleavage site, was hypomutable only in the cleavage- 
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competent variants (Table I), implying that contacts near the 
cleavage site are important for the cleavage event. 

Inhibition of the i-Tevl-DNA interaction by phosphate 
ethylation 

Given the extensive interaction between I-7>vI and the td 
horning site and considering the sequence tolerance of the 
site, it became of interest to probe the involvement of the 
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phosphate backbone. The results of a typical ethylation 
interference experiment are shown in Figure 4. Ethylation 
of phosphates flanking the insertion site interferes with upper 
and lower complex formation (Figure 4A and B). In contrast, 
ethylation of phosphates extending towards the cleavage site 
interfere only with upper complex formation, paralleling the 
hydroxy 1 radical protection results (cf. Figure 3B and 4B). 

Chemical probing of major and minor groove 
interactions 

Dimethylsulfate (DMS) modification of purine residues was 
used to examine groove-specific interactions of I-7evI with 
the homing site (Figure 5). Interference patterns indicate 
that methylation of purines flanking the insertion site 
interferes with both upper and lower complex formation 
(Figure 5A). Again, purine modifications approaching the 
cleavage site interfere only with upper complex formation 
(Figure 5A 'TOP', cf. UC and LC lanes), consistent with 
the foregoing genetic and physical studies (Figures 2-4). 



As with ethylation, no modification completely eliminated 
binding. Results of numerous methylation experiments are 
summarized in Figure 5B to indicate the purines that interfere 
with 1-revI binding relative to free DNA (F lane) (•), and 
those that only interfere relative to the unbound DNA fraction 
(UB lane) (O). This latter fraction contains enhanced bands 
due to modified DNA molecules that are excluded from the 
bound fractions. Of the seven more strongly interfering 
nucleotides, six are A residues, with three As and one G 
in sequences flanking the insertion site, and three As in the 
region approaching the cleavage site (Figure 5 B). 
• Considering that DMS methylates adenine residues at N3 
in the minor groove and guanine residues at N7 in the major 
groove (Siebenlist and Gilbert, 1980), the methylation 
interference data suggest predoinmantly minor groove 
interactions around the intron insertion site and approaching 
the cleavage site of the upper complex. These inferences are 
confirmed by the other physical data, which are summarized 
in Figure 6 and will be considered further in the Discussion. 
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Discussion 

The genetic and physical analyses combine to underscore 
several distinctive features of the 1-Tevl -target interaction. 
Remarkably, the id homing site variants revealed that no 
individual base within 48 bp is absolutely required for the 
interaction (Figure 2). Furthermore, the interaction between 
l-Tevl and the homing site is extensive, spanning at least 
three helical turns and occupying the entire circumference 
of the helix (Figure 6). Additionally, the horning site can 
be divided into three domains, DI, Dfl and DID, based on 
genetic and physical criteria. DI (in exon I) and DII (in exon 
II) are domains in which sequence preferences are more 
clearly manifest than in Dm, flanking the insertion site. By 
physical criteria, DII and DIE form a contiguous region 
important in formation of both inactive and cleavage- 
competent I-revI-DNA complexes. By contrast, DI has 



been implicated specifically in formation of the active 
complex. Finally, there appear to be extensive interactions 
involving both the rninor groove and the phosphate 
backbone. . 

A broad body of evidence is accumulating in favor ol 
interaction of I-7VrvI predominantly across the minor groove. 
First, hydroxyl radical protection and ethylation interference 
are apparent across the minor groove both around the 
insertion site (DIE) and towards the cleavage site (DI) 
(summarized and most clearly apparent in Figure 6). Second, 
complementary strands at the insertion site (DIE) and the 
downstream rim of the cleavage site region pi) exhibit peaks 
of hydroxyl radical protection (Figures 3B and 6) as weU 
as ethylation interference (Figures 4B and 6) that are shifted 
by several bases in the 3' direction, supportive of minor 
groove interactions (Dervan, 1986). Third, direct evidence 
against major groove interactions upstream of the intron 



2146 



l-Tevl endonuclease - DNA interaction 



A TOP 

AG UC LC UB F 



BOTTOM 

AG UC LC UB F 




I 0 * °- 



o*~— * - 



IS > 



IS t> 



cs 



" cs is 

* • CO* oo ^ o o 

5 '- TA7CAACGCTCAG7AGA7GTTT7CTTGGGTCTACCGTTTAATATTGCG-3 ' 
3 '• ATA6TTGCGAGTCgTCTACAAAAGAACWA6ATGGCAAATTATAACGC-5* 

•25 20 -15 -10 -5 * «5 +10 *15 



Fig. 5. Methylation interference analysis. (A) Interference of \-Tevi 
interaction by DNA methylation. Labeling is as in Figure 4 except that 
F = free methylated DNA. Modified purine residues that interfered 
with l-7VvI binding over three experiments are marked: • , 
interference evident against F and UB; o, interference evident against 
UB only. (B) Schematic representation of interference of the UC. 

insertion site was provided by creating, by site-specific 
mutagenesis, a dam methylation site (GATC) within 2 bp 
of the intron insertion site (G-2A) (Figure 6A). Methylation 
at this site did not reduce cleavage efficiency, suggesting 
that I-7evI is insensitive to methylation of adenine in the 
major groove at -2 (M.Bryk, unpublished). Fourth, 
although the physical data do not support minor groove 
interactions exclusively, the minor groove seems dominant, 
based on the observation that there are more methylated As 
(six) than Gs (one) which interfere strongly with binding in 
both DI and DID (Figures 5B and 6). It remains unclear 
whether guanine interference is due to isolated major groove 
contacts near regions of minor groove interaction, or whether 
this results from indirect inhibitory effects on complex 
formation (Siebenlist and Gilbert, 1980; Yang and Nash, 
1989; Derbyshire and Grindley, 1992). 

Interaction across the minor groove and phosphate 
backbone, the least discriminatory determinants of DNA 
sequence (Seeman et al, 1976), is consistent with the relaxed 
specificity of l-Tevl (Figures 1 and 2). It isnot yet clear 
to what degree I-7>vI interaction involves discriminatory 
contacts within the minor groove and to what degree it 
depends on the structural properties of DNA dictated by 
nucleotide sequence. Nevertheless, interaction outside the 



major groove is in accord with the ability of the enzyme to 
cleave T-even phage DNA, which has bulky glucosyl groups 
protruding into the major groove from all hydroxy- 
methylcytosine residues. 

Both members of the prokaryotic HU /MF fa mily (Tanaka 
et al. y 1984; Yang and Nash, 1989) and TFJJD, a eukaryotic 
transcription accessory which interacts with the TATA box, 
bind primarily to the minor groove of their respective targets 
(Lee et al, 1991 ; Starr and Hawley, 1991). \-Tev\ contacts 
may resemble TFUD- and IHF-DNA interactions, 
although these latter proteins appear to be more sequence- 
specific than l-Tevl. However, like l-Tevl, IHF and TFTJD 
envelop the helix. Interestingly, IHF and TFJJD share a 
common structural motif, the antiparallel jS-ribbon, capable 
of forming arms that can encircle the DNA (Tanaka et al., 
1984; Nash and Granston, 1992). Likewise, the N-tenninal 
region of I-7fcvI, which contains a conserved sequence motif 
(GIY...Y1G; Michel and Dujon, 1986) likely to be part of 
the enzyme's active site, is predicted by Chou-Fasman and 
GOR analyses (Devereux et al., 1984) to form a 0-ribbon 
(M.Bryk, unpublished). Experiments in progress will address 
the functional relevance of the common features in these 
systems. 

The incomplete interference caused by base modification 
within the homing site (Figures 4 and 5) and the high 
tolerance of I-7cvI for sequence polymorphisms in its 
substrate (Figure 2) are likely to be related phenomena, 
which may be explained in one of two ways. One 
interpretation is that the enzyme makes redundant contacts 
with its DNA target, consistent with the failure to completely 
inhibit binding by modification of any particular nucleotide 
[Figures 4 and 5 and hydroxyl radical interference (missing 
nucleotide) analysis, J.E.Mueller, unpublished]. Addition- 
ally, we have isolated a series of single- and double-mutant 
pairs (C+4A, A + 11C and C+4A:A+11C) where the 
single mutants have barely detectable phenotypes but the 
double mutant is highly cleavage-defective (M.Bryk, 
unpublished). These observations may suggest that l-Tevl 
interacts with both residues +4 and +11, and that either 
contact will maintain function but eliminating both 
determinants seriously impairs the enzyme -DNA 
interaction. The alternative interpretation is that l-Tevl 
recognizes structural features of the DNA. Thus binding 
interference might be partial if more than one base 
modification is required to disrupt a structural element. 
Similarly, the structural component may be preserved in the 
single mutants (C+4A and A + 11C), but disrupted in the 
corresponding double mutant (C+4A:A+11C). 

In the context of DNA structure, it is informative to 
examine the hydroxyl radical cleavage data. Hydroxyl 
radicals are sensitive probes for structural distortion of DNA 
in the minor groove (Burkhoff and Tullius, 1987). The 
hydroxyl radical cleavage pattern on free td homing site 
DNA reproducibly indicates that the top-strand bases at the 
intron insertion site are more susceptible to cleavage than 
neighboring residues, suggesting a structural perturbation 
in Dm (Figure 3A, lanes F). Also, the A/T-rich regions of 
the homing site (bp -14 to -5 and bp +7 to + 15) are 
resistant to DNase I cleavage in the absence of l-Tevl 
compared with other regions of the homing site (Bell- 
Pedersen et al, 1991), further suggesting local structural 
anomalies, such as differences in the width of the minor 
groove (Drew and Travers, 1985). The latter DNase I- 
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resistant region includes the hypomutable DD domain. 
Methylation, ethylation and hydroxyl radical interference 
(missing nucleotide) analyses (Figure 6 and data not shown) 
indicate that DII is an important recognition element, despite 
only partial contact detected by hydroxyl radical protection 
analysis. A possible explanation for incomplete protection 
in the face of modification interference is that DO assumes 
a recognition structure that is disrupted by modification. 
There are precedents for non-contacted bases influencing 
formation of protein -DNA complexes, including the P22 
and lambda repressor -operator interactions (Koudelka 
et al t 1987, 1988; Wu et al, 1992) and SU(Hw) protein 
interaction with specific sequences of the Drosophila gypsy 
element (Spana and Corces, 1990), It is of further interest 
that the 7 bp DII is composed entirely of A and T residues 
and that poly(dA/dT) tracts are structurally distinct from B- 
DNA, having both deeper and narrower minor grooves 
(BurkhofT and Tullius, 1987). 

Modified DNAs are resistant to restriction endonucleases 
when the modified nucleotide is within the recognition 
sequence of the enzyme. This is true for adenine-methylated 
as well as glucosylated hydroxymethylcytosine DNAs. 
Unlike restriction enzymes, which make base-specific 
contacts predominantly in the major groove (for example, 
EcoRI; reviewed in Jen-Jacobson etal., 1991), I-7>vI 
cleaves modified DNAs very efficiently and with equivalent 
specificity to unmodified substrates (BeU-Pedersen et al., 
1991; this work). Clearly, sequence-tolerant recognition 
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based on interactions across the minor groove and phosphate 
backbone and possibly involving structural features of the 
DNA extends the range of substrates that are susceptible to 
cleavage by this intron endonuclease. These recognition 
properties would increase the potential of the intron-encoded 
endonuclease to act on a variety of substrates in the wild, 
thereby extending the invasive properties and potential for 
dissemination of the mobile intron. 



Materials and methods 

Oligonucleotides . , 

Mutant td homing sites were selected from a degenerate oligonucleoude 
pool (NBS25) synthesized with phosphoramidite mixtures contanunatedto 
15% with each of the other three bases at 48 positions (Figure 1 A^ The 
PCR primers used to amplify the degenerate td homing sites were NBS26 
(5'-AAAGGATCCGAACTAACGTAGCATATGA-3 ) and NBS27 
(5 ' - AA AG AATTCT ATTTGGATTTGC A GTGG-3 ') , which contain 
tenninal BomHl and EcoK sites, respectively. Physical analysis of the 1- 
7*vI-DNA interaction was performed with a synthetic 80 bp td hormng- 
site duplex formed with P^Hed oligywl^ide OP46 
(S'-AATTCGGATTTGCAGTGGTATCAACGCTCAGJTAGA 
TCTTGGGTCTACCGTTTAATATTGCGTCATATGCTACGTTAGT- 

TCG-3') and its complement OP47 (5'^ATCCGAACTAACGTAGCA- 
TATGACGCAATATTAAACGGTAGACCCAAGAAAACATCTACTG- 

AGCGTTGATACCACTGCAAATCCG-3 ') . 

i-Tevl preparations . 
h Tevi was partially purified from Escherichia cob (BeU-Pedersen et al. t 
1991) or synthesized in vitro in wheat germ extracts (Promega, Madison, 
WD (BeU-Pedersen et at., 1989). One unit of activity is denned as the amount 
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of enzyme required to cleave 250 ng (78 fmol) of linearized target ( - 5 
kb) to 50% completion in 1 min at 37 °C. Physical analyses and the binding 
enrichment were performed with more highly purified in vivo preparations 
(0.1 -0.25 U7jd; 0.25-0.3 /ig/jtl total protein). Cleavage assays to classify 
td homing-site mutants utilized the more active in vitro I-7Vvl preparations 
(5-10 Wul; 100 n&tul total protein). 

Enrichment for functional td homing site mutants 
Homing-site variants enriched for the ability to bind I-7>vl were isolated 
by a series of sequential I-Tfvl binding and PCR amplification steps. The 
degenerate oligonucleotide NBS25 (25 pmol) (Figure 1) was amplified using 
a Perkin Elmer -Cetus (Norwalk, CT) PCR kit with 32 P-labeled primers 
NBS26 and NBS27. Double-stranded DNA was eluted from native 12% 
gels (29:1, aciylaniide:bisacrylarnide in 90 mM Tris- borate, 2 mM EDTA 
pH 8.0 (TBE)] in 0.5 M ammonium acetate, extracted with 
phenol/ chloroform/isoamyl alcohol and ethanol-precipitated. The degenerate 
pool of duplex homing sites (10-15 pmol) was then incubated with 1 .0- 1 .5 
U of l-Tev\ in binding buffer (50 mM Tris -HQ pH 8.0, 20 uglmi 
poly(dI/dC), 10 /ig/ml BSAJ for 5 min. TBE dye mix (TBE containing 0.01 % 
bromophenol blue, 0.01 % xylene cyanol, 50% glycerol) was added to 1/10 
vol and I-7VvI-DNA complexes were isolated from native gels, as above. 
The enriched pools from cycles 4 and 5 were cloned into pTZl 8U (United 
States Biochemical). Mutant td homing sites were sequenced using the M 13 
universal primer (5 '-GTAAAACGACGGCCAGT-3 ' ) . 

Statistical methods 

A two-stage sampling procedure (Tukey, 1977) was used to identify 
conserved regions within the functional homing-site variants. In the first 
stage, the exploratory phase, 24 variants were examined and two 
hypomutable regions, DI (8 bp, -21 to - 14) and DD (7 bp, +9 to + 15), 
were identified on the basis of lower mutation frequency (Figure 2C). In 
the second stage, the confirmatory phase, 36 variants were examined, where 
the normal approximation to the binomial was employed to compare the 
level of mutation in DI and DD with the input frequency of 15%. To assess 
variation in DID (16 bp, -8 to +8), a region defined by prior physical 
analysis, the observed mutation frequency of all 60 cleavage-proficient 
variants was compared with the 15% input degeneracy using the normal 
approximation to the binomial. To test the hypothesis thai DI and DD were 
more conserved than DID, a x 2 analysis comparing the mutation 
frequencies in DI and DD versus DID was used. 

Activity assays 

Sail -linearized pTZl 8U containing either the wild-type or variant td horning 
sites (200 ng; -3.0 kb) were incubated with 0.5 U I-7<rvI at 37 C C in assay 
buffer (50 mM Tris -HQ pH 8.0, 100 mM NaCl, 10 mM MgCl 2 ). At 
1 or 5 min, the reaction was stopped by adding stop-load buffer (0.05 mM 
EDTA, 5% SDS, 25% glycerol, 0.25% bromophenol blue) and samples 
were electrophoreses in 1% agarose gels in TAE (40 mM Tris -acetate, 
1 mM EDTA). The extent of cleavage was estimated by densitometry (Hoefer 
GS-300) with negative films of ethidium bromide-stained gels. 

Oligonucleotide labeling and duplex formation for physical 
analyses 

Non-purified oligonucleotides (800 pmol) were end-labeled with 
(7 J2 PJATP and T4 polynucleotide kinase. Full-length oligonucleotides 
were purified from denaturing (8.3 M urea) 10% TBE gels (19:1, 
acrylamide: bisacrylamide). Purified 32 P-labeled oligonucleotides (-100 
pmol) were annealed to 200 pmol of their non-labeled HPLC-purified 
complements by heating to 90*C for 5 min and slow cooling. 

Hydroxy! radical protection analyses 

Hydroxy! radical footprinting was done according to Tuliius and Dombroski 
(1985) with the following modifications. hTev] (30 fil, 3.0-7.5 U) was 
incubated with 20 pmol duplex in binding buffer for 1 min at 23 °C. The 
mixture was then adjusted to 10 mM L-ascorbic acid, 0.15% fyOj, 0.1 
mM (NH^FefSO^ ^HjO, 0.2 mM EDTA. After 1 min the reaction was 
quenched with 16 n\ TBE dye mix. Free DNA and bound complexes were 
isolated from native gels and separated on denaturing gels. Dried gels were 
exposed to X-ray film, which was analyzed by densitometry. 

Ethylation and methylation interference 

End-labeled DNA was modified by treatment with ethylnitrosourea or 
dirnethylsuliate as described by Siebenlist and Gilbert (1980). Modified DNA 
(1 pmol) in binding buffer was incubated with 0.05-0.1 U I-7>vi for 2 
min at 23 *C. I-7>vI-DNA complexes, unbound DNA and free DNA were 
eluted and cleaved at modified positions with NaOH (Hendrickson and 
Schleif, 1985), separated on 10% denaturing gels and analyzed by 
densiiometry. 
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